o         The  "linkage"  of  information  in  the sense  ensuring  comparability between
items  in different surveys,   in various  non-survey databases
(administrative  records),   or in the same survey over  time;   and

o       The   "linkage"  of  data   from different sources,   such  as   from  surveys   and

various  other federal  databases,   to  obtain  more   complete   information  using
existing  data.

Data  linkage  in the first sense  is referred  to  by Hawley who discusses  the
"integration and enhancement of  existing data"   and  recommends   the nesting of
"future studies  in such a way that data  from the same sites  could be
integrated."    He  suggests   that  "it might be possible  to conduct  the High School
and Beyond   (HSB)  and National Assessment studies in  the same or overlapping
locations."    He also suggests  the  connection of school process   studies   "in some
way   to the outcomes  being  studied  in the NAEP."

Also,   with reference to data linkage in the   first sense,   Plisko,   Ginsburg and
Chaikind make  the  following observation.     "In particular,   we need to examine
whether the division between repeated cross-sectional studies,   such as NAEP,   and
longitudinal studies,   such  as HSB are real or artificial distinctions."     They
outline the possibility of  attaching a  small  longitudinal component to  the NAEP
7th  grade sample  and to conduct a  follow-up   in  two years   to provide   "some
measure of  the extent of  attrition at  this  early level."

Data linkage  in the second  sense involves comparability that is  discussed
elsewhere  in  this paper and will not be discussed  further here.

Data  linkage  in the   third  sense  is often quite   feasible bat presents  a  number  of
technological problems  in database organization,  algorithm design  for record
matching,   and data processing  efficiency.

Data  linkage  in  the  third  sense   is especially relevant  to Federal  student
financial aid programs.     As Plisko,  Ginsburg and Chaikind put  it:

"Documenting  the  trends in aid recipients...has proved difficult  and
controversial.     However,   the problem is   not that there   is   insufficient
information,   but that different data sets are disjointed and cannot describe
the  total student aid package  in relation  to student  financial  need.     Hence,
this problem is one  of   linkage rather than  coverage. ..."   It  is worth noting
that High School and Beyond data are  currently being  linked  to Federal
Guaranteed Student Loan  (GSL)  and Pell  files so  that there are  significant
developments  in  technological  capabilities  for data  linkage  currently
underway.     Nervertheless,   there are other sources of data which  could be
considered for linkage purposes  and it  is  still   true  that   "much  of  the
information useful  for analyzing the higher education process  is  piecemeal,
serving only  the  specific needs  of   the  originating agency."

While further development of linkage  technology will not  redress   this   entirely,
it has the potential  for producing major benefits.

In a similar vein,  W.   Turnbull refers  to  "merging"   various  datasets,   perhaps
using a system of planned  "linking sections."    He  seems   to be   referring   to
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